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RAW SEQUENCE LISTING DATE : 10/08/2003 

PATENT APPLICATION: US/10/090 , 455A TIME: 14:27:58 

Input Set : E:\406.app.txt 
Output Set: N:\CRF4\10082003\J090455A.raw 

4 <110> APPLICANT: Chen, Hongyun 

5 Le Bihan, Stephane 

7 <120> TITLE OF INVENTION: NOVEL ABCG4 TRANSPORTER AND USES THEREOF 
10 <130> FILE REFERENCE: 100103.406 

12 <140> CURRENT APPLICATION NUMBER: US 10/090, 455A 

13 <141> CURRENT FILING DATE: 2002-03-01 
15 <160> NUMBER OF SEQ ID NOS: 23 

17 <170> SOFTWARE: FastSEQ for Windows Version 4 0 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 3455 

21 <212> TYPE: DNA 

22 <213> ORGANISM: Homo sapiens 

24 <400> SEQUENCE: 1 

25 gccaccatgg cggagaaggc gctggaggcc gtgggctgtg gactagggcc gggggctgtg 60 

26 gccatggccg tgacgctgga ggacggggcg gaaccccctg tgctgaccac gcacctgaag 120 

27 aaggtggaga accacatcac tgaagcccag cgcttctccc acctgcccaa gcgctcagcc 180 

28 gtgqacatca aottcot-.nnp nn+fr+^*- = + - . . y y ou 

29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
30 
11 

42 aagagcagcc ctgagaagaa cgaggtccct gccccatgcc ctccttgtcc tccggaagtg 1080 

43 gatcccattg aaagccacac ctttgccacc agcaccctca cacagttctg catcctcttc 1140 




ENTERED 



30 aaS?"^o tcaggtaaat tctgccgccg ggagctgatt 300 

31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
11 
32 
13 
14 

46 Taaatr+ltt llllTl^ ftctacctgc atattggcga cgatgccagc 1260 



1_ — tudyyidddi rcrgccgccg ggagctqatt 300 

ggcatcatgg gcccctcagg ggctggcaag tctacattca tgaacatctt ggcaggatac 360 

31 agggagtctg gaatgaaggg gcagatcctg gttaatggaa ggccacggga gctgaggacc 420 

32 ttccgcaaga tgtcctgcta catcatgcaa gatgacatgc tgctgccgca cctcacggtg 480 

33 "ggaagcca tgatggtctc tgctaacctg aatcttactg agaatcccga tgtgaaaaac 540 

34 gatctcgtga cagagatcct gacggcactg ggcctgatgt cgtgctccca cacgaggaca 600 
H 9 nr . n t C l f ^gcgggca gaggaagcgt ctggccatcg ccctggagct ggtcaacaac 660 

36 ccgcctgtca tgttctttga tgagcccacc agtggtctgg atagcgcctc ttgtttccaa 720 

37 gtggtgtccc tcatgaagtc cctggcacag gggggccgta ccatcatctg caccatccac 780 

38 cagcccagtg ccaagctctt tgagatgttt gacaagctct acatcctgag ccagggtcag 
tgcatcttca aaggagtggt caccaacctg atcccctatc taaagggact cggcttgcat 
tgccccacct accacaaccc ggctgacttc atcatcgagg tggcctctgg cgagtatgga 



840 
900 
960 



44 



aagaggacct tcctgtccat cctcagggac acggtcctga cccacctacg gttcatgtcc 



1200 



47 
48 
49 
50 
51 
52 
53 
54 



aaggtcttca acaacaccgg ctgcctcttc ttctccatgc tgttcctcat gttcgccgcc 1320 
ctcatgccaa ctgtgctcac cttcccctta gagatggcgg tcttcatgag ggagcacctc 1380 
aactactggt acagcctcaa agcgtattac ctggccaaga ccatggctga cgtgcccttt 1440 
caggtggtgt gtccggtggt ctactgcagc attgtgtact ggatgacggg ccagcccgct 1500 
gagaccagcc gcttcctgct cttctcagcc ctggccaccg ccaccgcctt ggtggcccaa 1560 
tctttggggc tgctgatcgg agctgcttcc aactccctac aggtggccac ttttgtgggc 1620 
ccagttaccg ccatccctgt cctcttgttc tccggcttct ttgtcagctt caagaccatc 1680 
TtrrtaTrT, * g " atgga ? ct cctatctc tcctatgtca ggtatggctt tgagggtgtg 1740 
atcctgacga tctatggcat ggagcgagga gacctgacat gtttagagga acgctgcccg 1800 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/090 , 455A 



DATE: 10/08/2003 
TIME: 14:27:58 



Input Set : E:\406.app.txt 

Output Set: N:\CRF4\10082003\J090455A.raw 

55 ttccgggagc cacagagcat cctccgagcg ctggatgtgg aggatgccaa gctctacatg 

56 gacttcctgg tcttgggcat cttcttccta gccctgcggc tgctggccta ccttgtgctg 

57 cgttaccggg tcaagtcaga gagatagagg cttgccccag cctgtacccc agcccctgca 

58 gcaggaagcc cccagtccca -gccctttggg actgttttaa ccttatagac ttgggcactg 

59 gttcctggcg gggctatcct ctcctccctt ggctcctcca caggctggct gtcggactgc 

60 gctcccagcc tgggctctgg gagtgggggc tccagccctc cccactatgc ccaggagtct 

61 tcccaagttg atgcggtttg tagcttcctc cctactctct "ccaacacctg catgcaaaga 

62 ctactgggag gctgctgcct ccttcctgcc catggcaccc tcctctgctg tctgcctggg 

63 agccctaggc tctctagggc cccacttaca actgaccaaa gtggccccct ctgggggtcc 

64 ccaccacaca agtgtttgta aactgggctg ctataaggtt ggagttccag ggctgggccc 

65 tggtggagtc cactggaagt cccattatgg atgttgaaat ggacagggaa ggactctgga 

66 agtctcttcc tcctcctcct cttctctcca cccctagacc ctggctgact tggacaatct 

67 gccaggacag aagctgggtt ttctgtctag gtcaccactc ccaatcctgg ggattggaga 

68 ggcctggggc tgtgggatgc cccatccccc tccccatcac ctttggtggg ggcagggcct 

69 ggtggcacct gtgcaataat gtctgtgttt ctctcccacc tgccactgga actggagaat 

70 gcactttatt ctgggcgggg ggtgagtggg ggaagaccca accctccttt ctcgctgccc 

71 ctaacgcatg cacggtctcg tgatgctccc tccctctccg gagtgacagg cacatacatg 

72 agaacaggcc atctcagccc tacacacttg ccatccccta cagcacagag gaagagtgat 

73 ggtggcatgc tggtggtggc gggtgctggt gggaggacag tgccaacctc ctcctgggga 

74 tcccatgttg gagactctaa ggataaggct ggtgctgccc agggtgtcta caggaactgc 

75 aggtgtctac ccccaagtct tccctcctcc caagccaggg gtggcacagg gcactagatc 

76 cctggagttc aggaaccaac acaagcacaa ccacgggcat aagttggcct tggccactgc 

77 cacccacggc cctccttttg tgctccatgc tggcatcttc actcccctac cccttcccca 

78 gccactgctg ctcattcaaa cttctgtcca tgtccctcca ctgttcctat cagcaggtgg 

79 cccctgggca tcagaacagc ctgccctggg caccaggtgg cagacacact cagagcatgt 



80 ctggctttcc tggtgggtcc aggctcattc tgcttctgat ttcccctccc ccagggctca 



81 
82 
84 



ttttccccct ttttcctgta cacatccctg tctacctcct ctcaccctgc cacagattct 
tcctatcaca cagggatgcc agttgtattt gtggg 
<210> SEQ ID NO: 2 

85 <211> LENGTH: 646 

86 <212> TYPE: PRT 

Homo sapiens 
2 



87 <213> ORGANISM: 
89 <400> SEQUENCE: 



90 Met Ala Glu Lys Ala Leu Glu Ala Val Gly Cys Gly Leu Gly Pro Gly 



91 1 



10 



15 



92 Ala' Val Ala Met Ala Val Thr Leu Glu Asp Gly Ala Glu Pro Pro Val 

Q Q OA 



93 
94 
95 



20 



25 



30 



Leu Thr Thr His Leu Lys Lys Val Glu Asn His He Thr Glu Ala Gin 



35 



40 45 

96 Arg Phe Ser His Leu Pro Lys Arg Ser Ala Val Asp He Glu Phe Val 

97 50 55 " 60 

98 Glu Leu Ser Tyr Ser Val Arg Glu Gly Pro Cys Trp Arg Lys Arq Glv 

99 65 70 - 75 80 

100 Tyr Lys Thr Leu Leu Lys Cys Leu Ser Gly Lys Phe Cys Arg Arg Glu 

101 • 85 90 95 

102 Leu He Gly He Met Gly Pro Ser Gly Ala Gly Lys Ser Thr Phe Met 

103 100 ' 105 HO 

104 Asn He Leu Ala Gly Tyr Arg Glu Ser Gly Met Lys Gly Gin lie Leu 

115 120 125 



1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3455 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/090 , 455A 



DATE: 10/08/2003 
TIME: 14:27:58 



Input Set : E:\406.app.txt 

Output Set: N:\CRF4\10082003\J090455A.raw 



106 Val Asn Gly Arg Pro Arg Glu Leu Arg Thr Phe Arg Lys Met Ser Cys 

107 130 135 140 

108 Tyr He Met Gin Asp Asp Met Leu Leu Pro His Leu Thr Val Leu Glu 

109 145 150 155 160 

110 Ala Met Met Val Ser Ala Asn Leu Asn Leu Thr Glu Asn Pro Asp Val 
HI 165 170 175 

112 Lys Asn Asp Leu Val Thr Glu He Leu Thr Ala Leu Gly Leu Met Ser 

113 180 185 " 190 

114 Cys Ser His Thr Arg Thr Ala Leu Leu Ser Gly Gly Gin Arg Lys Arg 

115 195 200 205 

116 Leu Ala He Ala Leu Glu Leu Val Asn Asn Pro Pro Val Met Phe Phe 

117 210 215 220 

118 Asp Glu Pro Thr Ser Gly Leu Asp Ser Ala Ser Cys Phe Gin Val Val 

119 225 230 235 240 

120 Ser Leu Met Lys Ser Leu Ala Gin Gly Gly Arg Thr He He Cys Thr 

121 245 ~ 250 ~ 255 

122 He His Gin Pro Ser Ala Lys Leu Phe Glu Met Phe Asp Lys Leu Tyr 

123 260 265 270 

124 He Leu Ser Gin Gly Gin Cys He Phe Lys Gly Val Val Thr Asn Leu 

125 275 280 285 

126 He Pro Tyr Leu Lys Gly Leu Gly Leu His Cys Pro Thr Tyr His Asn 

127 290 295 , 300 

128 Pro Ala Asp Phe He He Glu Val Ala Ser Gly Glu Tyr Gly Asp Leu 

129 305 310 315 ' 320 

130 Asn Pro Met Leu Phe Arg Ala Val Gin Asn Gly Leu Cys Ala Met Ala 

131 325 330 335 

132 Glu Lys Lys Ser Ser Pro Glu Lys Asn Glu Val Pro Ala Pro Cys Pro 

133 . 340 345 350 

134 Pro Cys Pro Pro Glu Val Asp Pro He Glu Ser His Thr Phe Ala Thr 

135 355 360 365 

136 Ser Thr Leu Thr Gin Phe Cys He Leu Phe Lys Arg Thr Phe Leu Ser 

137 370 375 380 

138 He Leu Arg Asp Thr Val Leu Thr His Leu Arg Phe Met Ser His Val 

139 385 390 395 400 

140 Val He Gly Val Leu He Gly Leu Leu Tyr Leu His lie Gly Asp Asp 

141 405 410 415 

142 Ala Ser Lys Val Phe Asn Asn Thr Gly Cys Leu Phe Phe Ser Met Leu 

143 420 425 430 

14 4 Phe Leu Met Phe Ala Ala Leu Met Pro Thr Val Leu Thr Phe Pro Leu 

145 435 440 445 

14 6 Glu Met Ala Val Phe Met Arg Glu His Leu Asn Tyr Trp Tyr Ser Leu 

147 450 455 460 

148 Lys Ala Tyr Tyr Leu Ala Lys Thr Met Ala Asp Val Pro Phe Gin Val 

149 465 470 475 480 

150 *Val Cys Pro Val Val Tyr Cys Ser He Val Tyr Trp Met Thr Gly Gin 

151 485 490 495 

152 Pro Ala Glu Thr Ser Arg Phe Leu Leu Phe Ser Ala Leu Ala Thr Ala 

153 500 505 510 

154 Thr Ala Leu Val Ala Gin Ser Leu Gly Leu Leu He Gly Ala Ala Ser 
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RAW SEQUENCE LISTING DATE: 10/08/2003 

PATENT APPLICATION: US/10/090 , 455A TIME: 14:27:58 

Input Set : E:\406.app.txt 

Output Set: N:\CRF4\10082003\J090455A.raw 

155 515 520 525 

156 Asn Ser Leu Gin Val Ala Thr Phe Val Gly Pro Val Thr Ala He Pro 

157 530 535 540 

158 Val Leu Leu Phe Ser Gly Phe Phe Val Ser Phe Lys Thr He Pro Thr 

159 545 550 555 560 

160 Tyr Leu Gin Trp Ser Ser Tyr Leu Ser Tyr Val Arg Tyr Gly Phe Glu 

161 565 570 ~ 575 

162 Gly Val He Leu Thr He Tyr Gly Met Glu Arg Gly Asp Leu Thr Cys 

163 580 585 590 

164 Leu Glu Glu Arg Cys Pro Phe Arg Glu Pro Gin Ser He Leu Arg Ala 

165 595 600 605 

166 Leu Asp Val Glu Asp Ala Lys Leu Tyr Met Asp Phe Leu Val Leu Glv 

167 610 615 620 

168 He Phe Phe Leu Ala Leu Arg Leu Leu Ala Tyr Leu Val Leu Arq Tvr 

169 625 630 635 640 

170 Arg Val Lys Ser Glu Arg 

171 645 

174 <210> SEQ ID NO: 3 

175 <211> LENGTH: 1941 

176 <212> TYPE: DNA 

177 <213> ORGANISM: Homo sapiens 

179 <400> SEQUENCE :. 3 

180 atggcggaga aggcgctgga ggccgtgggc tgtggactag ggccgggggc tgtggccatg 60 

181 gccgtgacgc tggaggacgg ggcggaaccc cctgtgctga ccacgcacct gaagaaggtg 120 

182 gagaaccaca tcactgaagc ccagcgcttc tcccacctgc ccaagcgctc agccgtggac 180 

183 atcgagttcg tggagctgtc ctattccgtg cgggaggggc cctgctggcg caaaaggggt 240 

184 tataagaccc ttctcaagtg cctctcaggt aaattctgcc gccgggagct gattggcatc 300 

185 atgggcccct caggggctgg caagtctaca ttcatgaaca tcttggcagg atacagggag 360 

186 tctggaatga aggggcagat cctggttaat ggaaggccac gggagctgag gaccttccgc 420 

187 aagatgtcct gctacatcat gcaagatgac atgctgctgc cgcacctcac ggtgttggaa 480 

188 gccatgatgg tctctgctaa cctgaatctt actgagaatc ccgatgtgaa aaacgatctc 540 

189 gtgacagaga tcctgacggc actgggcctg atgtcgtgct cccacacgag gacagccctg 600 

190 ctctctggcg ggcagaggaa gcgtctggcc atcgccctgg agctggtcaa caacccgcct 660 

191 gtcatgttct ttgatgagcc caccagtggt ctggatagcg cctcttgttt ccaagtggtg 720 

192 tccctcatga agtccctggc acaggggggc cgtaccatca tctgcaccat ccaccagccc 780 

193 agtgccaagc tctttgagat gtttgacaag ctctacatcc tgagccaggg tcagtgcatc 840 

194 ttcaaaggag tggtcaccaa cctgatcccc tatctaaagg gactcggctt gcattgcccc 900 

195 acctaccaca acccggctga cttcatcatc gaggtggcct ctggcgagta tggagacctg 960 

196 aaccccatgt tgttcagggc tgtgcagaat gggctgtgcg ctatggctga gaagaagagc 1020 

197 agccctgaga agaacgaggt ccctgcccca tgccctcctt gtcctccgga agtggatccc 1080 

198 attgaaagcc acacctttgc caccagcacc ctcacacagt tctgcatcct cttcaagagg 1140 

199 accttcctgt ccatcctcag ggacacggtc ctgacccacc tacggttcat gtcccacgtg 1200 

200 gttattggcg tgctcatcgg cctcctctac ctgcatattg gcgacgatgc cagcaaggtc 1260 

201 ttcaacaaca ccggctgcct cttcttctcc atgctgttcc tcatgttcgc cgccctcatg 1320 

202 ccaactgtgc tcaccttccc cttagagatg gcggtcttca tgagggagca cctcaactac 1380 

203 tggtacagcc tcaaagcgta ttacctggcc aagaccatgg ctgacgtgcc ctttcaggtg 1440 

204 gtgtgtccgg tggtctactg cagcattgtg tactggatga cgggccagcc cgctgagacc 1500 

205 agccgcttcc tgctcttctc agccctggcc accgccaccg ccttggtggc ccaatctttg 1560 

206 gggctgctga tcggagctgc ttccaactcc ctacaggtgg ccacttttgt gggcccagtt 1620 
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RAW SEQUENCE LISTING DATE: 10/08/2003 

PATENT APPLICATION: US/10/090 , 455A TIME: 14:27:58 

Input Set : E:\406.app.txt 

Output Set: N:\CRF4\10082003\J090455A.raw 

207 accgccatcc ctgtcctctt gttctccggc ttctttgtca gcttcaagac catccccact 1680 

208 tacctgcaat ggagctccta tctctcctat gtcaggtatg gctttgaggg tgtgatcctg 1740 

209 acgatctatg gcatggagcg aggagacctg acatgtttag aggaacgctg cccgttccgg 1800 

210 gagccacaga gcatcctccg agcgctggat gtggaggatg ccaagctcta catggacttc 1860 

211 ctggtcttgg gcatcttctt cctagccctg cggctgctgg cctaccttgt gctgcgttac 1920 

212 cgggtcaagt cagagagata g " 1941 

214 <210> SEQ ID NO: 4 

215 <211> LENGTH: 674 

216 <212> TYPE: PRT 

217 <213> ORGANISM: Homo sapiens 

219 <400> SEQUENCE: 4 

220 Met Ala Ala Phe Ser Val Gly Thr Ala Met Asn Ala Ser Ser Tyr Ser 

221 15 10 15 

222 Ala Glu Met Thr Glu Pro Lys Ser Val- Cys Val Ser Val Asp Glu Val 

223 20 25 30 

224 Val Ser Ser Asn Met Glu Ala Thr Glu Thr Asp Leu Leu Asn Gly His 

225 35 40 45 

226 Leu Lys Lys Val Asp Asn Asn Leu Thr Glu Ala Gin Arg Phe Ser Ser 

227 50 55 60 

228 Leu Pro Arg Arg Ala Ala Val Asn He Glu Phe Arg Asp Leu Ser Tyr 

229 65 70 75 ^ 80 

230 Ser Val Pro Glu Gly Pro Trp Trp Arg Lys Lys Gly Tyr Lys Thr Leu 

231 85 90 95 

232 Leu Lys Gly He Ser Gly Lys Phe Asn Ser Gly Glu Leu Val Ala He 

233 100 105 110 

234 Met Gly Pro Ser Gly Ala Gly Lys Ser Thr Leu Met Asn He Leu Ala 

235 115 120 125 

236 Gly Tyr Arg Glu Thr Gly Met Lys Gly Ala Val Leu lie Asn Gly Leu 

237 130 135 140 

238 Pro Arg Asp Leu Arg Cys Phe Arg Lys Val Ser Cys Tyr lie Met Gin 

239 145 150 155 160 

240 Asp Asp Met Leu Leu Pro His Leu Thr Val Gin Glu Ala Met Met Val 

241 165 170 175 

242 Ser Ala His Leu Lys Leu Gin Glu Lys Asp Glu Gly Arg Arg Glu Met 

243 180 185 190 

244 Val Lys Glu He Leu Thr Ala Leu Gly Leu Leu Ser Cys Ala Asn Thr 

245 195 200 205 

24 6 Arg Thr Gly Ser Leu Ser Gly Gly Gin Arg Lys Arg Leu Ala He Ala 

247 210 215 220 

248 Leu Glu Leu Val Asn Asn Pro Pro Val Met Phe Phe Asp Glu Pro Thr 

249 225 230 235 240 

250 Ser Gly Leu Asp Ser Ala Ser Cys Phe Gin Val Val Ser Leu Met Lys 

251 245 250 255 

252 Gly Leu Ala Gin Gly Gly Arg Ser lie lie Cys Thr He His Gin Pro 

253 260 265 270 

254 Ser Ala Lys Leu Phe Glu Leu Phe Asp Gin Leu Tyr Val Leu Ser Gin 

255 275 280 285 

256 Gly Gin Cys Val Tyr Arg Gly Lys Val Cys Asn Leu Val Pro Tyr Leu 

257 290 295 300 



file://C:\CRF4\Outhold\VsrJ090455A.htm 



10/8/03 



Page 6 of 8 



RAW SEQUENCE LISTING ERROR SUMMARY DATE: 10/08/2003 

PATENT APPLICATION: US/10/090 ,455A TIME: 14:27:59 

Input Set : E:\406.app.txt 

Output Set: N:\CRF4\10082003\J090455A.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 

ll^tl i 1 ^^ 9 *° SnSUre that 3 corres P° n di*3 explanation is presented in the <220> 
to <tzi> fields of each sequence which presents at least one n or Xaa. 

Seq#:13; Xaa Pos. 579,598 
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VERIFICATION SUMMARY DATE: 10/08/2003 

PATENT APPLICATION: US/10/090 ,455A TIME: 14:27:59 

Input Set : E:\406.app.txt 

Output Set: N:\CRF4\10082003\J090455A.raw 

L:836 M:341 W: (46) »n» or "Xaa" used, for SEQ ID#:13 after pos.:576 
M:341 Repeated in SeqNo=13 p ° 
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